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Abstract 

In our previous work, we have shown that many of the properties of the Florida power grid 
are reproduced by deterministic network growth models based on the minimization of energy 
dissipation E^ ss . As there is no a priori best E& i ss minimizing growth model, we here present a 
tool, called the “centrality fingerprint,” for probing the behavior of different growth models. The 
centrality fingerprints are comparisons of the current flow into/out of the network with the values 
of various centrality measures calculated at every step of the growth process. Finally, we discuss 
applications to the Maryland power grid. 
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I. BACKGROUND 


This work is concerned with the study of power grids using the language and methods 
of complex network theory. Since, in all their detail, today’s electrical distribution grids are 
the largest engineered systems ever built [lj], our coarse-grained approach focuses exclusively 
on high-voltage transmission lines, high-capacity generators, and switching and transmission 
substations. These three elements are represented by the edges and vertices of a complex 
network. Our primary test case is the real Florida power grid (FLG), whose N = 84 vertex 
network is depicted in Fig. [ljo. Such network models can be applied to test Intelligent 
Islanding strategies for limiting cascading blackouts j^. 


Previous publications on this topic { 2 , 3] have focused on variants of a Monte Carlo “cool¬ 
ing” model. I 11 that approach, power lines are randomly connected between loads and gen¬ 
erators distributed randomly over a rectangular geography. This results in an unrealistically 
long total line length L, which is reduced by a Metropolis line-switching process, in which the 
Hamiltonian is equal to L and the temperature is chosen to match the total length of lines 
in FLG. Thus, in the Monte Carlo power-grid models, both L and M —the total number of 
lines—are explicitly matched. Further refinements in 4] have captured other features of FLG 
at the cost of explicitly matching the total edge resistance R , where the resistance of the edge 
ij is equal to (geographical distance between i and j)/ (number of lines between i and j). 


In an effort to illuminate the architecture of power grids, we have more recently sought 
to produce models that coincide with FLG in several key metrics, but without explicit 
matching. To this end, we have introduced a family of deterministic growth models that start 
with a minimal-length spanning tree over the vertices and add lines one by one according 
to a fixed rule. The inspiration behind the rule comes from the behavior of general resistor 
networks with fixed current boundary conditions—currents flowing either in or out of the 
network at certain junctions. The problem is to solve for the internal currents flowing 
across the resistors of the network. In this situation, of all the current flows consistent with 
Kirchhoff’s Current Law, the one realized by nature is that which minimizes the total energy 
dissipation E diss = Yl e e edges O ne can apply this kind of dissipation optimization 

concept to any network that features in and out flows. In the present case—as in our 
previously published work—the AC power-distribution problem of the electrical grid is recast 
as a DC current-flow problem. The current flowing into the power grid network at a vertex 
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FIG. 1. Growth model based on real Florida vertex positions and in/out currents, (a) Illustration 
of the stopping criterion. The smallest difference (y 2 ) between the growth-driving currents and 
exponential centrality is found when the number of lines M = 208. See Fig. [2h for a generalized 
stopping criterion, (b) The FLG network. Arrowheads indicate direction of current flow along 
edges, while brighter arrow color indicates stronger currents. Thicker arrows indicate more parallel 
lines within multiedges. Red/blue vertex colors indicate high/low vertex potentials, (c) The grown 
network starting with the minimum spanning tree subgraph of the real Florida grid network and 
ending with M = 208 lines according to the stopping criterion. 


is proportional to the generating capacity of a corresponding power plant. Analogously, the 
current flowing out of the network at a vertex is the corresponding load power consumption. 
Naturally, the energy dissipation of the grid will be highly sensitive to the distribution of 
these currents, and since this is the motivation behind our growth model, we call these 
in/out current values the growth-driving currents. 


Choosing appropriate values for these currents is vital to the success of the model. Data 
for the generator capacities (the positive growth-driving currents) are generally available 
from power-plant management. On the other hand, we must resort to estimation for the 
load power consumptions (negative growth-driving currents). In making these estimates, our 
guiding assumption is that topologically important load vertices will have relatively higher 
out-currents. In the next section we sharpen the concept of topological “importance,” iden¬ 
tifying it with network centralities found in the literature js]. Any free parameters introduced 
in our centralities must be calibrated to match the (known) generator data. The best agree¬ 
ment is found using the exponential centrality at T ^ 2.4 (see Sec. ED- We note that the 
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growth-driving generator and load currents are normalized to 1 and —1, respectively. 

With the intuitively plausible E^iss optimization principle in hand, we may proceed to 
define various rules for choosing lines in the growth model. The best results (for the metrics 
in Table [I]) have been found with the “Cost/Benefit” model, in which the added line is 
chosen to minimize AL/\ AT^issI- Here, the length of the line is associated with the cost, 
while the benefit is the drop in -Ediss- A rival model captures the same idea by focusing on the 
total (rather than marginal) cost and benefit, minimizing E^ SS L 2 . Additionally, we explore 
power-grid networks that are not cost-constrained: we choose the line that minimizes Ediss 
at every step— this we call the “pure E ^ KK minimization” model. 

The only element that remains to be specified is a stopping criterion for the network 
growth. The results reported here correspond to stopping when the y 2 between the growth¬ 
driving currents and the exponential centrality measure (with the parameter T = 2.4) is 
minimal. This is described further in the the next section. The results of the growth model 
for real Florida generator/load geographies are reported in Fig. [0 Here, the starting point 
of the growth is the minimal-length vertex-spanning tree subgraph of FLG. The results for 
certain key metrics are listed in Table [U There, C is the clustering coefficient (b| and e values 
are the network mixing patterns—the fraction of power lines with ends on different vertex 
types{ 7 ]. We stress the lack of explicit matching for any of these metrics. 

TABLE I. Properties of the growth model for FLG. 

Network Ediss M # edges # multiedges C L/N R e gg e g i eu 

FLG 0.029 200 137 48 0.21 1.09 139. 0.085 0.263 0.39 

Growth Model 0.013 208 130 43 0.22 1.11 129. 0.087 0.36 0.20 

II. CENTRALITY FINGERPRINTS 

We have remarked that the appropriate choice of centrality measure is critical to the 
success of the growth model. Many of the most common centralities can be written^ in the 
form CCr=o A n /n) 11), where |1) is the column vector with all entries equal to 1, and f n is a 
function that assigns weights to different powers of the network’s adjacency matrix A. To 
get insight into this formula, note that (A n )ij is equal to the number of paths of length n 
from vertex i to j. (Here the parallel lines that make up a multiedge contribute to distinct 
paths.) The % component of the vector A n |1) is then the total number of paths of length n 
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FIG. 2. Centrality fingerprints for three -Ehi ss -based growth models. The centralities shown 
are eigenvector (light blue), exponential (gold), normalized exponential (dark blue), A 2 (purple), 
degree (green), and “Ping-Pong” (red-purple). The preceding list is in descending order of x 2 for 
500 lines in (a). Part (b) features a null-hypothesis result, where the growth-driving currents in 
the definition of x 2 are replaced by randomly chosen currents. The centralities of primary interest 
have x 2 values that stabilize near 1, the value for uncorrelated random currents. 

starting on vertex i. Onr assumption is that an “important” vertex will be the endpoint of 
many paths. Furthermore, we wish to weight short paths more than long, winding paths; 
this can be accomplished by choosing a function f n that falls off appropriately fast. 

Here we consider five centrality measures of this form. The simplest is degree centrality , 
proportional to the number of lines incident on each vertex and calculated from A|l). (We 
also consider A 2 |l).) The eigenvector centrality , proportional to the principal eigenvector of 
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A, is obtained from letting f n — 0 for all n ^ n' while n! — )■ oo. The exponential centrality , 
with adjustable parameter T, is given by f n = l/(rdT n ). Note that, after many growth 
steps, A will be large and higher-power terms will dominate, whence we recover eigenvector 
centrality. To address this issue, we also define the normalized exponential centrality with 
L = (1 /n\{TM) n ). 

Recall that we have already employed centrality measures to estimate the (negative) 
growth-driving currents. Here, we use them for a second purpose: to probe the extent to 
which the growth model seeks to reproduce the growth-driving currents. This can be quan¬ 
tified by a normalized y 2 measure equal to Y2 v e vertices(current(u) — centrality(n)) 2 . (We 
have chosen the normalizing factor Nf 1 so that the maximum possible y 2 is approximately 
100, while y 2 = 1 is obtained as the average value for uncorrelated random in/out currents 
satisfying our normalization requirements.) The y 2 can be calculated for every step in the 
growth to create a dynamical picture of the growth process. Of course, we do not expect y 2 
to have the same tendencies for every centrality. We propose that comparing many different 
y 2 s for a given growth model can give us (i) insight into the appropriateness of the central¬ 
ity measure and (ii) a unique illustration of the growth model’s behavior. We call such a 
comparison a “centrality fingerprint,” and display examples in Fig. [2] for the three growth 
model variants previously discussed. 

Comparing Figs. [5b, Eh, and Etl, we note the marked qualitative differences in the y 2 
curves. Though in each plot, the un-normalized exponential centrality is rising to meet the 
eigenvector centrality, the normalized exponential (along with the A 2 centrality) stabilizes 
only in the Cost/Benefit model. Furthermore, only the Cost/Benefit model admits y 2 s 
that continually fall as the network is grown. This suggests that the Cost/Benefit model 
asymptotically matches these centralities to the growth-driving currents, and thus that those 
centralities are closely related to that growth model. 

The lowest y 2 obtained for the Cost/Benefit model has not yet been introduced. This is 
the “Ping-Pong” centrality, which we developed to capture the relationship between sources 
and sinks. This centrality marks generators as important if they have many paths to loads 
and vice versa. This is accomplished by simulating many random walks on the network, 
alternating between absorbing on generators and loads. The Ping-Pong centrality can be 
expressed as the principal eigenvalue of a certain walk matrix, a form which allows us to 
prove that it reduces to the degree centrality in bipartite networks; this is echoed in the 
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similarity of the y 1 2 3 4 5 for degree and Ping-Pong centralities in Figs. [2^ and[2]3. 

Finally, we note that in Figs. [2£i and|2t we see minima in certain y; 2 values at a number 
of lines (~ 200) corresponding to the number of lines in the real Florida grid. Any of these 
minima could be plausibly identified as stopping points for the network growth model. 

In the preceding considerations, we have compared growth models using the real Florida 
geography and generator capacities. It is reasonable to expect that varying these conditions 
significantly may lead to different behavior. For example, the power grid of the state of 
Maryland[9] has different network characteristics than FLG, including a significantly lower 
clustering coefficient. Our Cost/Benefit model run with the Maryland data creates a cen¬ 
trality fingerprint very different from that in Fig. [2]a.. A possible explanation is that none of 
the centralities considered here provide a good estimate for the Maryland generating capac¬ 
ities, leading to an invalid extrapolation to the load power demands. We leave it for future 
research to find appropriate centrality measures for the Maryland and other grids. 
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